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Name 


Description 


VM Interpreter 


Converts FORTH words (Strings) to FORTH tokens and literals 


VM Engine 


Executes FORTH tokens and literals 


Script Translator 


Converts scripts written in "the grammar of this invention" to FORTH words 


Automata Composition 
Engine 


Composes modeling NFAs from FORTH regular expressions and converts NFAs 
to executable NFAs via subset-construction 


Automata Execution 
Engine 


Executes an NFA against the input stream, and returns side-effects as compiled 
FORTH tokens 
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VM 


VM Engine 


Script Translator 


Automata 


Automata 


Interpreter 






Composition Engine 


.Execution engine 


EParser.java 


Engine.java 


Parser.java 


AnyGrammar.j ava 


AcceptState .j ava 


Interpreter.java 


E Array java 


Script.java 


Arcjava 


Automata.j ava 


Console.java 


E Array Re f.java 


ScriptParser.j ava 


ArcList.java 


FastArc.java 


Parser .java 


EAtom.java 


Stream.java 


ArcNums .j ava 


FastGraph.j ava 




hDict.java 


Token.java 


ButGranunar.java 


FastNode.java ■ 




EFrame.java 


1 okenlStream.java 


ButNotGrammar.j ava 


IntQueueHeapjava 




EInstr.java 


SUnit.java 


CharClass .java 


SimpleCharMap.java 




ELiteral.java 


i> u nit_r> iocko uTit.j ava 


CharClassGrammar.java 






EObjects.java 


IS U nit_break5>tmt.java 


CharGrammar . j ava 






hStack.java 


o u nit_continueotmt.j ava 


ConcatGramniar.j ava 






EStream.java 


SUnitDeclatmt.java 


ContainsGrammar. j ava 






EType.java 


SUnitDoStmt.java 


DoGrammar.j ava 






OutStream.java 


SUnit_EmptyStmt.java 


Grammar.java 






Stream.java 


SUnit_Expr.j ava 


Graph.java 






Range.java 


SUnit_ExprStmt.java 


InhibitGrammarjava 








b U nit_F orbtrntj ava 


Instrs.java 








SUnit_Frameable.java 


IterateGrammar.j ava 








SUnitFuncDecl .j ava 


Node .java 








b U nit_lilS tmt.j ava 


NodeSet.java 








S U mt_rrint!S tmtj ava 


NullGraniniar.java 








IS Umt_Keturn:>tmt.java 


OptionalGrammar.java 








IS U iutKuleDeclz .j ava 


Rej ectGrammar.java 








SUnitbtmLjava 


RepeatGramniar.java 








SUnit_StmtList.java 


Segment.java 








SUnitTokenizeStrntjava 


StringGrammar.j ava 








SUmtWmleStmt.j ava 


SubsetNode.java 








l ypeL/cci.java 


1 Ilipivjralllllldi .Java 








VarDecl.java 


Transition.java 








VarDeclScopejava 


TransitionArray.java 








VarDeclStackjava 


TransitionGroup.java 










UnionGrammar.java 










UpCountjava 
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Operators 


Purpose 


Associativity 


* 


iterate and repeat 


left-to-right 


+ >+ <+ 


left-to-right and right-to-left 
concatenation 


left-to-right 


1 


union 


left-to-right 




conditional 


right-to-left 


but butnot 


subjunctive expressions 


left-to-right 
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FORTH word 


Stack operand(s) 


Stack result(s) 


Description 


{xx 


- 


- 


begins compilation of an executable 
instr-array for do-patterns 


xx} 






ends compilation of an executable 
instr-array for do-patterns 


nop 






the "nop" instruction 


pdo 


x-array-obj pat-obj x-array-obj 


pat-obj 


the low-level "do-pattern" 


"AB" 




"AB" 


pushes the literal "AB" onto the 
FORTH stack 


String.Pattern 


str-obj 


pat-obj 


converts a FORTH string to a 
FORTH pattern object 
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FORTH word 


Stack operand(s) 


Stack result(s) 


Description 


swap 


objl obj2 


obj2 objl 


swaps top elements on FORTH stack 


char.Pattern 


char-obj 


pat-obj 


converts top element of FORTH stack 
from character object to Pattern object 


CharClass.Pattern 


charclass-obj 


pat-obj 


converts top element of FORTH stack 
from char-class to pattern 


char.String 


char-obj 


str-obj 


converts top element of FORTH stack 
from char to string 


P? 


pat-obj 


pat-obj 


modifies top Pattern on FORTH stack 
via unary "optional" operator 


Pi 


pat-obj 1 pat-obj2 


pat-obj 


creates a new pat-obj which is a union of 
the 2 patterns at top of stack 


s+ 


str-objl str-obj2 


str-obj 


creates a new str-obj which is a 
concatenation of 2 strings at top of stack 



Figure 7 



r UK in word 


rSetore stack 


Alter stack 


A A^A«/ A \ 

Action(s) 


-> 




varO-ref 


Push reference to $0 local variable 


ir a nit 

-> [AB] 
CharClass.Pattern 




pat-obj 


Construct/push [AB] as 
CharClassGrammar object 


-> assign 


varO-ref pat-obj 




Assign [AB] object to $0 local variable 


-> $1 




varl-ref 


Push reference to $1 local variable 


-> $0 valueof 


- 


varO-pat-obj 


Push value of $0 local variable -> 
'[AB] 1 


-> assign 


varl-ref varO-pat-obj 




Assign '[AB]* object to $1 local variable 


-> $0 




varO-ref 


Push reference to $0 local variable 


-> $0 valueof 




varO-pat-obj 


Push value of $0 local variable -> 
'[AB]' 


-> p? 


pat-obj 


pat-obj 


Construct/push an OptionalGrammar 
object -> ? f [AB]' 


-> $0 valueof 




varO-pat-obj 


Push value of $0 local variable -> 
'[AB]' 


-> p? 


pat-obj 


pat-obj 


Construct/push an OptionalGrammar 
object -> ?'[AB]' 


-> p<+ 


pat-obj 1 pat-obj2 


pat-obj 


Pop 2 grammar objects, construct a new 
ConcatGrammar object, and push -> 

?'[AB]' <+ ?'[AB] f 


-> swap 


varO-ref pat-obj 


pat-obj vaiO-ref 


Swap the reference and grammar objects 
on stack 


-> 1 get 


pat-obj varO-ref 


pat-obj varO-ref pat- 
obj 


Push another reference to grammar 
object 


-> assign 


varO-ref pat-obj 


- 


Assign $0 to grammar obj -> 
?'[AB]'<+ ?'[AB]' 


-> drop 


pat-obj 




Drop top element 


-> $2 




var2-ref 


Push a reference to $2 local variable 


-> SO valueof 




varO-pat-obj 


Get value of $0 variable -> 
? , [AB] , <+ ?'[AB]' 


-> $1 valueof 




varl -pat-obj 


Get value of $1 variable -> f [AB] f 


-> p+ 


pat-obj 1 pat-obj2 


pat-obj 


Pop top 2 grammar objs, construct new 
obj -> (?'[AB]' <+ ?'[ABR + '[AB]' 


-> assign 


var2-ref pat-obj 




Assign above grammar object to 
variable $2 
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Contemplated 


Evolved to 


Reason 


['a', V, ! o\:9T 


'[abO-9]' 


Given in section 8.3.5.1 


anyother 


butwithout 

jutnot 

Dutalso 


anyother cannot be modeled effectively as an 
NFA. The subjunctive form properly 
generalizes the negated character-class. 


butwithout 

butnot 

*)utalso 


but 
3utnot 


DUt and butnot are more accurately converses 
of each other; butwithout is just a design 
pattern based on butnot 


righttoleft 
lefttoright 


<+ 
>+ 


The <+ and >+ variations of the default + 
concatenation operator are more compatible 
with a C-style interpreter and C-style 
precedence/associativity rules. 


P*N1*N2 


P*N1..N2 


Better to introduce a range type (and a 
binary range .. operator; ana 10 ovenoau a 
binary * operator that already exists in the C 
operator-list, than to introduce a new tertiary 
operator form. 


reject 
accept 


reject 


accept cannot be modeled as an NFA (its 
meaning is context dependent); there are 2 
possible interpretations to the meaning of 
accept; no use cases that demonstrate the 
ability to solve an otherwise unsolvable 
problem. 


"unary after" 


"unary before" 


z « as unary operators niubt lumc uciuic tu 
be C expression syntax compatible 


PI otherwise P2 otherwise P3 




TVi-in /vrommoi* V»OC \7olllO r\llt tint 1T1 tn 1 C TAI'Ill 

inis grammar nas vaiue dui iiol ui una luiui 
- will be part of a continuation patent. 


choice<index>(Pattern[]) 


- 


The "dynamic switch" can be modeled by 
line-by-line composition, leveraging a 
feature of instantiated production rules - see 
example tmpl6.tok of section 8.4.2.4 


concat<Pattem> (Pattern) 




postponed 


ignorecase<boolean> 
(Pattern) 




not novel, postponed 1 


"back-referencing" feature 




postponed 


"recursion" support 




Postponed because of complications 
surrounding the subjunctive, which is 
implemented through subset construction. 



